Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/17430/15

ViSQOL: An Objective Speech Quality Model

01-Jan-1970 Research 2015 : October - December

Andrew Hines, Jan Skoglund, Anil C Kokaram, Naomi Harte

This paper presents an objective speech quality model, ViSQOL, the Virtual Speech Quality Objective Listener. It is a\nsignal-based, full-reference, intrusive metric that models human speech quality perception using a spectro-temporal\nmeasure of similarity between a reference and a test speech signal. The metric has been particularly designed to be\nrobust for quality issues associated with Voice over IP (VoIP) transmission. This paper describes the algorithm and\ncompares the quality predictions with the ITU-T standard metrics PESQ and POLQA for common problems in VoIP:\nclock drift, associated time warping, and playout delays. The results indicate that ViSQOL and POLQA significantly\noutperform PESQ, with ViSQOL competing well with POLQA. An extensive benchmarking against PESQ, POLQA, and\nsimpler distance metrics using three speech corpora (NOIZEUS and E4 and the ITU-T P.Sup. 23 database) is also\npresented. These experiments benchmark the performance for a wide range of quality impairments, including VoIP\ndegradations, a variety of background noise types, speech enhancement methods, and SNR levels. The results and\nsubsequent analysis show that both ViSQOL and POLQA have some performance weaknesses and under-predict\nperceived quality in certain VoIP conditions. Both have a wider application and robustness to conditions than PESQ or\nmore trivial distance metrics. ViSQOL is shown to offer a useful alternative to POLQA in predicting speech quality in\nVoIP scenarios.

How to Cite this Article
CC Compliant Citation: Hines et al., ViSQOL: an objective speech\nquality model, ournal on Audio, Speech, and Music Processing\n(2015) 2015:13, DOI 10.1186/s13636-015-0054-9, (http://\ncreativecommons.org/licenses/by/4.0).
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/17430/15

ViSQOL: An Objective Speech Quality Model

How to Cite this Article

Links

Contact Us